Llama 3.1 Nemotron Nano 4B V1.1
Other
Llama-3.1-Nemotron-Nano-4B-v1.1 is a large language model derived from Llama 3.1 8B through compression, optimized for inference efficiency and task execution, suitable for local deployment on a single RTX GPU.
Large Language Model
Transformers English